Biomedical Named Entity Recognition at Scale
نویسندگان
چکیده
Named entity recognition (NER) is a widely applicable natural language processing task and building block of question answering, topic modeling, information retrieval, etc. In the medical domain, NER plays crucial role by extracting meaningful chunks from clinical notes reports, which are then fed to downstream tasks like assertion status detection, resolution, relation extraction, de-identification. Reimplementing Bi-LSTM-CNN-Char deep learning architecture on top Apache Spark, we present single trainable model that obtains new state-of-the-art results seven public biomedical benchmarks without using heavy contextual embeddings BERT. This includes improving BC4CHEMD 93.72% (4.1% gain), Species800 80.91% (4.6% JNLPBA 81.29% (5.2% gain). addition, this freely available within production-grade code base as part open-source Spark NLP library; can scale up for training inference in any cluster; has GPU support libraries popular programming languages such Python, R, Scala Java; be extended other human with no changes.
منابع مشابه
Biomedical Named Entity Recognition System
We propose a machine learning approach, using a Maximum Entropy (ME) model to construct a Named Entity Recognition (NER) classifier to retrieve biomedical names from texts. In experiments, we utilize a blend of various linguistic features incorporated into the ME model to assign class labels and location within an entity sequence, and a postprocessing strategy for corrections to sequences of ta...
متن کاملAt-least-N voting over biomedical named entity recognition systems
Biomedical named entity recognition (BNER) has been actively studied over the years, and several BNER systems have become publicly available. In this study, we investigate the utility of a simple voting method called at-least-n voting to improve gene name recognition, which takes advantage of the availability of BNER systems in the domain. We found this voting scheme is effective in combining B...
متن کاملBiomedical Named Entity Recognition Using Neural Networks
We investigate the task of Named Entity Recognition (NER) in the domain of biomedical text. There is little published work employing modern neural network techniques in this domain, probably due to the small sizes of human-labeled data sets, as non-trivial neural models would have great difficulty avoiding overfitting. In this work we follow a semi-supervised learning approach: We first train s...
متن کاملReranking for Biomedical Named-Entity Recognition
This paper investigates improvement of automatic biomedical named-entity recognition by applying a reranking method to the COLING 2004 JNLPBA shared task of bioentity recognition. Our system has a common reranking architecture that consists of a pipeline of two statistical classifiers which are based on log-linear models. The architecture enables the reranker to take advantage of features which...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2021
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-68763-2_48